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mth Differential Gene Expression of the TACSTD2 Gene 
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Rapid postnatal growth is associated with increased risk of 
childhood adiposity. The aim of this study was to establish 
whether this pathway is mediated by altered DNA methylation 
and gene expression. Two distinct cohorts, one preterm {n = 121) 
and one term bom (n = 6,990), were studied. Exploratory analyses 
were performed using microarrays to identify differentially ex- 
pressed genes in whole blood from children defined as "slow" 
(n = 10) compared with "rapid" (n = 10) postnatal (term to 12 
weeks corrected age) growers. Methylation within the identified 
TACSTD2 gene was measured in both cohorts, and rs61779296 
genotype was determined by Pyrosequencing or imputation and 
analyzed in relation to body composition at 9-15 years of age. In 
cohort 1, TACSTD2 expression was inversely correlated with 
methylation (P = 0.016), and both measures were associated with 
fat mass (expression, P = 0.049; methylation, P = 0.037). Although 
associated with gene expression (cohort I, P = 0.008) and meth- 
ylation (cohort 1, P = 2.98 X 10"^^; cohort 2, P = 3.43 X 10"^^), 
rs6 1779296 was not associated with postnatal growth or fat mass 
in either cohort following multiple regression analysis. Hence, the 
lack of association between fat mass and a methylation proxy 
SNP suggests that reverse causation or confounding may explain 
the initial association between fat mass and gene regulation. Non- 
causal methylation patterns may still be useful predictors of later 
adiposity. Diabetes 61:391-400, 2012 




Both nutritional status and grov^h in early postnatal 
life have been shown to affect neurodevelopment 
and metabolic health into adulthood. Periods of 
rapid (or catch-up) grovv1:h in infancy are asso- 
ciated with adverse metabolic consequences in later life 
and are thought to lead to a subsequent increase in the risk 
of obesity and cardiovascular disease and the development 
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of insulin resistance (1-3). Faster postnatal weight gain has 
been associated with subsequent adiposity at age 10 years 
in a prospective study of children mainly bom at term (4,5) 
and a separate study of obese persons aged 5-22 years (6). 

In the context of the preterm infant, rapid postnatal 
growth may be beneficial for neurodevelopment (7). A 
study of early postnatal and late infancy growth in a cohort 
of preterm infants showed that early weight gain from 
birth to 3 months, and to a lesser extent from 3 months to 1 
year, was associated with a higher percentage of body fat, 
a greater abdominal pattern of fat distribution, and higher 
BMI SD scores at 19 years of age (8). Early growth and 
postnatal diet were also shown to have an effect on insulin 
sensitivity (9), indices of cardiovascular function (10), and 
other metabolic outcomes (11) in a preterm cohort aged 
13-16 years. Evidence therefore shows that early growth 
in preterm children is associated with long-term influences 
on body composition. 

The mechanisms underlying the association between 
early growth and later body composition are not yet clear. 
One potential mechanism explaining how early life expo- 
sures are memorized and persist throughout the life course 
is epigenetic modiflcations, whereby mitotically heritable 
changes in gene expression and chromatin organization 
occur independently of changes to the DNA sequence 
itself The most widely examined epigenetic modification 
is DNA methylation where cytosine bases, primarily lo- 
cated next to guanine residues, are covalently bound at the 
carbon 5 position to a methyl group. A role for epigenetic 
processes in the programming of body composition has 
been proposed (12-14) and is supported by recent evidence 
from studies of DNA methylation in infants in relation to 
body composition in childhood. Two recent studies have 
reported associations between methylation in DNA extracted 
from cord blood (15) and umbilical cord tissue (16) and 
prepubertal adiposity in children. 

The aim of the current study was to investigate whether 
growth during early postnatal life is associated with vari- 
ability in gene expression and whether such variation is 
associated with later body composition in childhood and 
mediated by epigenetic mechanisms. Furthermore, in 
recognition of the strong potential in epigenetic epidemi- 
ological studies for both confounding (where a factor not 
on the causal pathway is associated with both exposure 
and outcome) and reverse causation (where phenotype 
impacts on the epigenetic marks being measured), addi- 
tional strategies to strengthen causal inference were re- 
quired. Hence, a "genetical epigenomics" approach (17), 
whereby a single nucleotide polymorphism (SNP) corre- 
lating highly with local DNA methylation levels and which 
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cannot be subject to confounding and reverse causation 
was used as a proxy to interrogate causal relationships. 
Such approaches offer insight into causality, allowing more 
robust conclusions to be drawn about the mediating role of 
epigenetic mechanisms that are postulated to link postnatal 
growth with childhood adiposity. 

RESEARCH DESIGN AND METHODS 

A time line depicting the age from which measurements and samples were used 
in both cohorts is shown in Supplementary Fig. 1, available in the Supplementary 
Data. The analysis workflow for both cohorts and the number of samples used 
at each stage are shown in Supplementary Fig. 2A and B. 
Cohort 1 (Newcastle Preterm Birth Growth Study). Healthy preterm 
infants were recruited from the Special Care Baby Unit, Royal Victoria In- 
firmary (Newcastle upon Tyne, U.K.) between September 1993 and November 
1999 (18,19). Anthropometric, biochemical, and nutrient markers were taken 
at birth, at intervals up to 18 months corrected age, and at —11 years of age. 
Whole blood (EDTA-Vacutainer and PAXgene Blood RNA tube; BD, Franklin 
Lakes, NJ) and saliva (Oragene • DNA; DNA Genotek, Kanata, Canada) samples 
were collected at age —11 years for DNA and RNA analysis. QIAamp DNA 
Blood Midi kits (Qiagen, West Sussex, U.K.) and Oragene DNA Purifier kits 
(DNA Genotek) were used for DNA extraction from blood and saliva samples, 
respectively. Total RNA was extracted using PAXgene Blood RNA System kits 
(Qiagen). Ethics approval for this study was obtained from the Joint Ethics 
Committee of Newcastle and North Tyneside. 

Cohort 2 (Avon Longitudinal Study of Parents and Children). Pregnant 
women from the Avon area in the southwest of England whose expected dates 
of delivery were between April 1991 and December 1992 were recruited. The 
Avon Longitudinal Study of Parents and Children (ALSPAC) is a prospective 
study, and the extensive data collected during pregnancy and throughout 
childhood have previously been described in detail (http://www.alspac.bristol. 
ac.uk) (20). Anthropometric data taken at —9 and —15 years of age along with 
DNA extracted from cord blood buffy coats and whole blood samples col- 
lected at —7 years of age (21) were provided for use in this study. All DNA 
samples were prepared using standard phenol-chloroform extraction methods. 
Ethics approval was obtained from the ALSPAC Law and Ethics Committee 
and from local research ethics committees. 

Definition of postnatal growth and body composition. Postnatal growth 
was defined as the difference in weight standardized score (SDS) between term 
and 12 weeks gestationally corrected age (i.e., 12 weeks from the expected term 
date) (range 10-16) in cohort 1 or between birth and 8 weeks of age (range 5- 
12) in cohort 2. These differing time points could not be avoided, as they were 
reliant on available data within these distinct cohorts. Weight SDS, adjusted 
for sex and age, was calculated using the British 1990 growth reference 
standards (22). Growth rate was then expressed and analyzed as both a 
continuous and dichotomized variable, in which children were split into slow 
and rapid growers based on the median weight change SDS across each study 
subgroup. 

Body composition in childhood (cohort 1,-11 years of age; cohort 2, —9 
and —15 years of age) was assessed by height (Harpenden stadiometer; Holtain, 
Crosswell, U.K.), weight (Tanita body fat analyzer; Tanita), BMl (kilograms per 
meters squared), fat mass, lean mass, and total mass (whole-body dual-energy 



X-ray absorptiometry) (cohort 1: software version 11, GE Lunar iDXA ma- 
chine, GE Healthcare UK & Ireland; cohort 2: Prodigy scanner. Lunar Radiation, 
Madison, Wl), using standard protocols (18-20). Data were expressed in abso- 
lute terms and as SDS (22). 

Gene expression array analysis. For cohort 1, RNA samples, extracted from 
whole blood from 20 individuals and selected on the basis of their rapid or slow 
growth and availability of RNA, were sent to ServiceXS (Leiden, the Nether- 
lands) for gene expression analysis. Individuals used for expression analysis 
were matched for sex, and all were of white European ethnicity. Globin re- 
duction was performed using a GeneChip Globin-Reduction kit (PreAnalytiX/ 
Affymetrix). Human NuGO-Hsla520180 GeneChip CEL files were normalized in 
BioConductor using the GCRMA package. Log-fold change was calculated 
using the RankProd package (percentage of false positive predictions [pfp] < 
0.05 with 100 permutations of the class labels) to identify genes with differential 
expression. Annotations were attached to probe sets from the nugohsla520180. 
db library (http://www.bioconductor.org/). Raw and normalized data from the 
experiment was deposited in the Gene Expression Omnibus (http://www.ncbi. 
nkn.nih.gov/geo/) with accession no. GSE22013. Real-time RT-PCR was per- 
formed with Superscript 111 First Strand Synthesis System (Invitrogen) for 
validation of microarray data on the candidate gene selected for follow-up 
(primers in Table 1). 

In silico analysis of gene expression data. After identification of differ- 
entially expressed loci, Gene2Promoter (Genomatix, Munich, Germany) was 
used to identify potential promoter regions of target genes. Promoter modules 
and CpG islands were identified using Modellnspector (version 5.4; Genomatix) 
and CpG Island Explorer, version 2.0 (23), respectively. 
Methylation analysis by gene-specific Pyrosequencing. For both cohorts, 
methylation of selected cytosine residues was quantified using Pyrosequencing. 
For cohort 1, all individuals with adequate blood-extracted DNA samples and 
postnatal growth data (n = 94) were included in the analysis and, where 
available, methylation was also quantified in DNA extracted from saliva samples 
(n = 68) for comparison. For cohort 2, DNA samples derived from cord blood 
(n = 148) and age 7 years whole blood (n = 161) with postnatal growth data 
were analyzed. Bisulphite conversion of 1 and 2 fxg DNA was carried out using 
the EZ DNA Methylation kit (Zymo Research, Irvine, CA) (modification to CT 
conversion reagent-DNA mix incubation: 20 cycles of 95°C for 30 s; 50°C for 
15 min) and the EZ DNA Methylation-Gold kit (Zymo Research) (following 
the manufacturer's instructions), respectively. To determine the percentage of 
methylation at individual CpG sites, quantitative bisulphite Pyrosequencing 
(Qiagen) with Pyro Q-CpG software (version 1.0.6.) was used (primers in Table 1). 
Pyrosequencing PCR cycling conditions were as follows: denaturation at 95°C 
for 15 min and then fifty cycles of 95°C for 15 s, 50°C for 30 s, and 72°C for 15 s 
and extension at 72°C for 5 min followed by 4°C hold. 
Quality-control assessment of methylation analysis. Calibration curves 
were constructed to assess whether there was any PCR amplification bias 
between unmethylated and methylated sequences of TACSTD2. PCR reactions 
were run in duplicate. Pyrosequencing success rates in cohort 1 were 97 and 
94% for DNA that was extracted from blood and saliva, respectively and, in 
cohort 2, 94% for DNA extracted from cord blood and 97% from year 7 blood. 
Genotype analysis 

Cohort 1. SNPs located within the promoter region of the target gene were 
analyzed by Pyrosequencing (Qiagen). All available DNA samples were used 
(n = 122). Pyrosequencing PCR reactions (primers in Table 1) on genomic 
DNA, cycling conditions, and sequencing were performed as described above 



TABLE 1 



List of primers 



Primer 


5' ^ 3' 


Annealing 
temperature 


Amplicon size 


TACSTD2 real-time forward 


GAATCCATTGCGACATTGTG 


60 


208 




GAPDH real-time forward 


TGAAGGTCGGAGTCAACGGATTTG 


60 


128 


GAPDH real-time reverse 








TACSTD2 control forward 


TGGTGTTAGGATGAGATAA 


50 


472 


TACSTD2 control reverse 








TACTSD2 pyro forward 


AGGGAGTGAGAGAAATTA 


50 


345 


TACTSD2 pyro reverse 








TACSTD2 pyro sequencing 


TGGGGTTGGGAAAGA 






TACSTD2 genJg/tttHKKKKM 








TACSTD2 genotype reverse 


CTCCCGGCAGAAAGTCTAAAGAG 






TACSTD2 genotype sequencing 


GACTATCAGACCCGTTTTA 
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with automated genotype calling carried out using PyroMark MD (Qiagen). 
Genotype success rate was 98%. 

Cohort 2. Extensive genome-wide SNP data across large subgroups of the 
ALSPAC cohort have previously been generated and were available for analysis 
(http://www.alspac.bristol.ac.uk). Genotypes were imputed across —10,000 
individuals using lUumina HumanHap550 and 1000 Genomes deep sequence 
data as reference sets. Imputations were carried out in Markov Chain Hap- 
lotyping software (version 1.0.16; MACH) (http://www.sph.umich.edu/csg/ 
abecasis/MACH/). SNPs with imputation quality scores/r^ values <0.8 were 
rejected from further analysis. Of the 10,000 individuals included in the im- 
putation analysis, 6,990 had data relating to postnatal growth and, hence, were 
included in subsequent analyses performed within this study. 
Statistical analysis. The two cohorts were analyzed using the same protocol. 
(See Supplementary Fig. 3 for an overview of relationships interrogated.) 
Associations between postnatal growth and TACSTD2 expression/methylation 
were assessed using nonparametric Mann- Whitney U and Spearman rank cor- 
relation tests, given the nonnormality of both exposure and outcome variables. 
Relationships between TACSTD2 expression/methylation and measures of body 
composition were initially explored using Spearman rank correlation followed 
by linear regression analysis to identify potential confounders. (Sex, age, height, 
and pubertal status at follow-up were considered.) For regression analyses, log 
transformations of the outcome measures were used where necessaiy. Associ- 
ations between TACSTD2 genotypes and postnatal growth and between geno- 
types and expression/methylation were assessed by and Kruskal-Wallis tests, 
respectively. Relationships between genotypes and measures of body com- 
position were also explored using Kruskal-Wallis tests followed by linear 
regression analysis as described above. Possible differences in baseline 
characteristics across study subgroups were tested by univariate analyses as 
appropriate. All statistical analyses were performed using STATA, version 10 
(StataCorp, College Station, TX). 



RESULTS 

Cohort characteristics. Two different cohorts were 
available for analysis: one of preterm children with follow- 
up data at a mean age of 11 years (cohort 1, n = 121) and 
the other composed of largely term-bom children with 
follow-up data at mean ages of 9 and 15 years (cohort 2, 
n = 6,990). Gene expression and methylation analyses 
were performed across smaller subgroups in which 
adequate RNA and DNA samples were available. These 
subgroups were representative of the larger cohorts 
(Supplementary Table 1). Baseline characteristics are pre- 
sented in Supplementary Table 1. Expected differences 
were observed in gestational age and birth weight between 
the two cohorts. Specifically, in cohort 1 the average ges- 
tation was 31 weeks and mean ± SD birth weight was 1.4 ± 
0.3 kg compared with mean gestation 40 weeks and mean 
birth weight 3.5 ± 0.6 in cohort 2. The median change in 
weight SDS between term and —12 weeks corrected age in 
cohort 1 was 0.09 (range -1.10 to 2.66). In cohort 2, the 
median change in weight SDS between birth and ~8 weeks 
of age was 0.06 (-3.64 to 8.25). These two postnatal growth 
distributions were comparable (Mann-Whitney U: z = 1.00, 
P = 0.316). 

Postnatal growth was associated with differential gene 
expression in infants born preterm. Genome-wide ex- 
pression analysis of RNA samples from 10 slow and 10 rapid 
postnatal growers (cohort 1-expression) revealed 989 loci 
that were differentially expressed (data available in the 
Gene Expression Omnibus [http://www.ncbi.nlm.nih.gov/ 
geo/], accession no. GSE22013). The 50 most differentially 
expressed loci (with a log-fold increase >1.8) were inves- 
tigated using in sHico approaches to define putative epige- 
neticaUy regulated genes. TACSTD2 was identified as the 
top hit, being the most differentially expressed locus (with a 
3.22 log-fold increase in slow compared with rapid growers) 
containing a CpG island within its promoter region. Four 
other loci showed greater differences in expression levels 
but did not contain a CpG island: DEFA4 and AP0BEC3B, 
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for which slow growers had higher expression levels com- 
pared with rapid growers, and HLADRB4 and IFI27, for 
which rapid growers had higher expression levels com- 
pared with slower growers. Table 2 presents associations 
between postnatal growth and TACSTD2 normalized 
expression levels. Validation of TACSTD2 expression 
measurement was successfully performed using RT-PCR 
(Spearman rank correlation between TACSTD2 array data 
and RT-PCR data: p = 0.64, P = 0.003). 
TACSTD2 gene expression was associated with 11- 
year fat mass in a preterm cohort. Childhood body 
composition measures across cohort 1 are available in 
Supplementary Table 2. There was weak evidence for a 
positive correlation between normalized TACSTD2 expres- 
sion levels and fat mass (Spearman p = 0.44, P = 0.061) but 
not for other indices of body composition (Supplementary 
Table 2). 

TACSTD2 promoter methylation patterns across 

childhood. Whole blood samples, taken at —11 years of 
age, were available for DNA methylation analysis from 94 
children born preterm (cohort 1-methylation). In addition, 
148 cord blood and 161 whole blood samples taken at 7 
years of age were available from cohort 2 (cohort 2- 
methylation) for validation analyses. Figure 1 illustrates 
the topography of the TACSTD2 gene region. Strong cor- 
relations in percentage methylation were demonstrated 
across the seven CpG sites investigated in three sources of 
DNA (average Spearman p: cohort 1-methylation 0.95, P = 
1.05e-39; cohort 2-cord blood 0.97, P = 4.17e-71; cohort 2- 
age 7 blood 0.95, P = 3.24e-61) (Supplementary Table 3). 
Hence, all subsequent analyses were performed using the 
mean methylation across all seven CpG sites. Mean meth- 
ylation levels derived from blood samples in cohort 1 were 
higher than but correlated with those estimated using DNA 
from available saliva samples (n = 68, Spearman p = 0.71, P = 
1.12e-ll) (Supplementary Table 4). In cohort 2, there was 
a strong positive correlation between mean methylation 
levels in cord blood and those in whole blood at age 7 years 
(n = 148, Spearman p = 0.63, P = 1.67e-17). Overall, meth- 
ylation levels at 7 years of age were slightly higher (—1.3%) 
than those measured in cord blood from the same children 
(Wilcoxon signed rank test for mean methylation - 1.78, P = 
0.076) (Supplementary Table 5). 

TACSTD2 promoter methylation was associated with 
postnatal growth and 11 -year fat mass in a preterm 
cohort. A negative correlation was demonstrated between 
mean methylation and TACSTD2 expression levels in cohort 
\ {n = 19, Spearman p = -0.55, P = 0.016). In concordance 
with the expression data, methylation levels were associ- 
ated with postnatal growth, with lower levels observed in 
the slow growers compared with rapid growers (Table 2). In 
addition, fat mass was negatively correlated with mean 
TACSTD2 methylation (Table 3). Multiple linear regression 
analyses were used to identify potential confounders. For 
this analysis, fat mass was log transformed. Height was 
identified as an independent predictor of log(fat mass). 
Upon adjustment, TACSTD2 methylation remained asso- 
ciated with fat mass (Table 4). 

Associations between TACSTD2 promoter methyla- 
tion, postnatal growth, and childhood fat mass were 
not replicated in a larger term cohort. Childhood body 
composition measures across cohort 2 are presented in 
Supplementary Table 6. TACSTD2 methylation at 7 years 
of age was not associated with postnatal growth in this 
largely term-bom cohort (Table 2). Fat mass at 15 years of 
age was positively correlated with DNA methylation in cord 
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FIG. 1. Topography of the TACSTD2 gene. DNA methylation analysis of TACSTD2 focused on seven CpG sites within the promoter region of the 
gene, located on chromosome (Chr)l. Positions are annotated with respect to the start codon. Two SNPs were identified at 162 and 158 base pairs 
from the closest CpG site analyzed (rs61779295 and rs61779296). Information was derived from UCSC Genome Browser on Human Mar. 2006 
(NCBI36/hgl8) Assembly. 



blood (Table 3). A similar pattern was also demonstrated 
between fat mass at 15 years and methylation measured at 
7 years (Table 3). Associations were not evident between 
methylation in either DNA sources or fat mass at the age 
of 9 years (Table 3). The positive correlations between 
methylation and fat mass at 15 years of age observed in 
this term cohort were inconsistent with those observed in 
the preterm cohort at age 11 years. Furthermore, multiple 
regression analyses indicated that these initial associations 
were not independent of confounding factors, namely, 
height and sex (Table 4). 

TACSTD2 promoter SNPs were associated with gene 
regulation. Two SNPs (rs61779295 and rs61779296), situ- 
ated 158 and 162 base pairs upstream of the first CpG site, 
respectively, were selected for analysis (Fig. 1). In cohort 1, 
these SNPs were genotyped directly in 122 children (cohort 



1-genotype). Neither SNP had been genotyped previously 
within cohort 2. Hence, imputations using existing Ulumina 
HumanHap550 quad chip and 1000 Genomes data were 
performed. The resulting quality scores and values for both 
SNP genotypes were extremely robust at 0.994 and 0.998, 
respectively, enabling the imputed data to be used for sub- 
sequent investigations (cohort 2-genotype). Both SNPs con- 
formed to Hardy-Weinberg equilibrium (P > 0.05) and were 
found to be in perfect linkage disequilibrium in both cohorts 
iP' = 7^ = 1). Hence, only rs61779296 was used for sub- 
sequent analyses. Robust evidence of association between 
genotype and TACSTD2 gene expression and promoter 
methylation was demonstrated in both cohorts (Table 5). 
TACSTD2 genotype was not associated with postna- 
tal growth or fat mass in either cohort. No evidence of 
association between postnatal growth and rs61779296 was 



TABLE 3 

Association analysis between childhood fat mass and TACSTD2 genetic markers 









Correlation/comparison* 




N 


Fat mass (kg) 




P 


Cohort 1 (median age 12 years 
[interquartile range 3]) 


Expression 


20 




0.43 


0.061 


Mean methylation 


91 




-0.22 


0.037 


Rs61779296 genotype 


GG 


71 


11.30 (7.09-15.45) 






GA 


35 


12.89 (8.48-19.85) 


8.48 


0.014 


AA 


12 


17.14 (15.51-18.25) 






Cohort 2 (median age 10 years 
[interquartile range 0.3]) 


Cord blood: mean methylation 


131 




0.10 


0.241 


Year 7 blood: mean methylation 


144 




0.04 


0.612 


Rs61779296 genotype 


GG 


2,544 


7.02 (4.73-10.98) 






GA 


1,876 


7.44 (4.94-10.85) 


1.29 


0.526 


AA 


350 


7.41 (4.76-10.52) 






Cohort 2 (median age 15 years 
[interquartile range 0.3]) 


Cord blood: mean methylation 


111 




0.20 


0.040 


Year 7 blood: mean methylation 


121 




0.17 


0.068 


Rs61779296 genotype 


GG 


1,806 


13.57 (8.21-20.27) 






GA 


1,345 


13.52 (8.62-19.60) 


0.08 


0.962 


AA 


250 


13.56 (8.65-19.19) 







Data are median (25th-75th percentile) unless otherwise indicated. ^Correlations between fat mass and continuous markers (i.e., expression 
and methylation) were tested using nonparametric Spearman rank correlation tests, and associations between fat mass and categorical 
markers (i.e., genotype) were tested using nonparametric Kruskal-Wallis tests. 
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TABLE 4 

Multiple linear regression models for childhood fat mass (In) and TACSTD2 genetic markers 





Multiple linear regression model* 






Coefficient (95% CI) 


t 


P 




Cohort 1 (median age 12 years [interquartile range 3]) 


Fat mass vs. expression {n = 20) 


Expression 


0.13 (<0.01-0.25) 


2.11 


0.049 


0.199 


Fat mass vs. methylation (n = 90) 


Height 


0.02 (0.01-0.03) 


5.02 


2.762e-06 


0.270 


Mean methylation 


-0.01 (-0.01 to <0.01) 


-2.12 


0.037 




Fat mass vs. rs61779296 genotype (n = 117) 




GG 


Reference group 










AA 


0.26 (<0.01 to 0.53) 










Fat mass vs. cord blood: methylation (n = 111) 












Female 


0.77 (0.52-1.01) 


6.26 


8.018e-09 




Mean methylation 


<0.01 (<0.01 to 0.01) 


1.37 


0.173 




Fat mass vs. year 7 blood: methylation (n = 121) 


Male 


Reference group 








Female 


0.78 (0.55-1.00) 


6.76 


5.756e-10 




Height 


0.02 (0.01-0.04) 


3.07 


0.003 


0.30 


Mean methylation 


<0.01 (<0.01 to 0.01) 


0.76 


0.451 





Fat mass was transformed using the natural logarithm in both cohorts. Coefficients (95% CIs), t statistics, and P values are provided for 
independent covariants. values are provided for the overall model. Height measures were taken at -12 and 15 years of age for cohorts 1 and 
2, respectively. *Associations were explored using multiple linear regression models. 



observed in cohort 1-genotype or cohort 2-genotype (Table 
2). In concordance with expression and methylation data, 
fat mass demonstrated evidence of association with geno- 
type in cohort 1 (Table 3). However, after multiple regres- 
sion analysis, rs6 1779296 was not predictive of Iog(fat 
mass) after adjustment for height (Table 4). In addition, 
there were no associations between this genetic marker and 
fat mass at either 9 or 15 years of age in cohort 2 (Table 3). 



DISCUSSION 

The current study investigated the molecular mechanisms 
underpinning previously reported associations linking 
rapid postnatal growth with increased adiposity in child- 
hood (4-6,8). Gene expression analysis highlighted a novel, 
putative epigenetically regulated locus, TACSTD2, which 
demonstrated increased expression in preterm children who 
grew slowly compared with those who grew rapidly post- 
nataUy. Higher TACSTD2 expression levels were also asso- 
ciated with decreased promoter methylation at this locus 
and increased fat mass at age —11 years. These observa- 
tions were not consistent with the expected pattern of rapid 
postnatal growth leading to increased adiposity in later life, 
although this inverse association could be a feature specific 
to preterm infants who have been shown to have higher 
resting energy expenditure and thus be protected from 
obesity (24). 

The possible explanations for the observed, potentially 
causal, associations were probed in more detail. Regression 
analysis indicated that height explained much of the varia- 
tion in fat mass and may have acted as a confounding var- 
iable in the association between methylation and fat mass, 
although this association did not disappear completely 
after adjustment. Height and sex were also shown to be 



confounding variables in the association between methyla- 
tion at 7 years and fat mass at 15 years of age observed in 
the replication, albeit term, cohort. In addition, the initial 
univariate analyses were suggestive of a positive relation- 
ship between methylation and fat mass in cohort 2 — the 
opposite of the inverse correlation observed in cohort 1. 
Although the unexpected trend in cohort 1 might be related 
to distinct grov^ trajectories of preterm infants, such 
inconsistencies in observed effects could also be evidence 
of spurious associations. Furthermore, given that the sam- 
ples used in the expression analyses were also included 
in the methylation analysis, a false-positive effect would 
have been carried forward to the subsequent, albeit larger, 
methylation analysis. The potential for reverse causality, in 
which differences in childhood fat mass are responsible for 
changes in methylation, cannot be excluded from the find- 
ings observed in cohort 1, given that TACSTD2 methylation 
and childhood fat mass were measured at the same time 
point. Hence, the possibility remains that this finding could 
be spurious. 

The availability of both cord and age 7 blood samples in 
cohort 2, along with longitudinal body composition data, 
enabled further exploration of the causal pathways among 
postnatal grov\^h, DNA methylation, and later adiposity. 
Comparisons between DNA methylation in cord blood and 
later life may indicate whether observed differences in 
methylation detected in childhood result from poor post- 
natal grovs^h (or other environmental factors) or preexist 
in the infant. However, no robust associations were ob- 
served between postnatal grovs^h and methylation at birth 
or at age 7 years in this term cohort. Furthermore, analysis 
of TACSTD2 methylation in paired cord blood and whole 
blood DNA taken 7 years later showed little variation 
(1.3%). If growth were driving methylation changes during 
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postnatal life, one might anticipate the change over this 
time period to be considerably greater. 

Interest in the interplay between genetic variation and 
DNA methylation patterns has gained considerable mo- 
mentum (25-27). A large majority of currently identified 
methylation-infiuencing SNPs occur in cis regions, usually 
within 5 kb of the methylation site (25,28), a feature that 
permits their potential application as a proxy for methyla- 
tion measures. Such potential proxies within the TACSTD2 
promoter region were identified. Rs61779296 correlated 
strongly with methylation levels in the promoter region 
and was therefore used as a surrogate to explore the re- 
lationship between methylation at this locus and childhood 
fat mass in the larger sample set. This approach, based on 
the principles of the now widely used Mendelian random- 
ization method (29,30) and those similarly applied in the 
analysis of gene expression data (31,32), has been termed 
"genetical epigenomics" (17) and helps overcome the prob- 
lems of reverse causation and confounding commonly en- 
countered in (molecular) epidemiological studies. 

Multiple regression analysis of rs6 1779296 with post- 
natal growth and measures of fat mass demonstrated no 
evidence of association in either instance, despite this SNP 
showing clear association with both DNA methylation and 
expression levels. An instrumental variable analysis using 
the rs6 1779296 SNP data was undertaken to further probe 
the relationship between methylation and fat mass dem- 
onstrated in cohort 1 (33). Although this analysis was un- 
derpowered, preventing robust conclusions, it suggested 
that confounding may be responsible for the observed as- 
sociation. We conclude, therefore, that the scenario depic- 
ted in Fig. 2 is the most likely; i.e., the observed associations 
between postnatal growth, TACSTD2 methylation and ex- 
pression, and fat mass are explained largely by confound- 
ing. Much of the confounding could be explained by sex or 
height, although some residual confounding is likely. This 
correlation between fat mass and altered DNA methylation 
accords with other recent studies linking BMI with meth- 
ylation status in adults (34,35), where the direction of cau- 
sality and potential confounders remain undefined. 

Despite our conclusion that TACSTD2 methylation is 
unlikely to provide a causal link between rapid postnatal 
growth and altered adiposity, this locus is clearly different 
in a subset of children with increased fat mass and may 
have particular relevance in a preterm setting. The TACSTD2 
gene encodes a ceU-surface glycoprotein that is implicated in 
ceU proliferation (36). Increased expression of TACSTD2 
correlates with poor prognosis in several cancers in- 
cluding those of the oral cavity (37), prostate (38), and 
ovary (39) and liver metastases of colorectal cancer (40). 
Our observed association of increased expression with 
higher fat mass, while novel, is consistent with such adi- 
pocyte hyperplasia and hypertrophy, mechanisms that 
have also been implicated in differential fat accumulation 
in adults (41). 

The study findings focused on a single locus — an ap- 
proach that has inherent limitations. Epigenetic control of 
development is likely to occur at a network/pathway level, 
so the analysis of an isolated gene is unlikely to give a full 
picture (42). However, focusing on one locus enabled 
a thorough assessment of the potential causal pathway. 
Specifically, this study combined analyses of the exposure 
of postnatal growth, methylation, expression, and geno- 
type of TACSTD2 and body composition from the same 
individuals, providing a high level of insight into the mo- 
lecular pathways linking postnatal growth to later adiposity. 
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FIG. 2. Defining the causal pathway linlcing postnatal growth with fat mass. The schematic depicts the postulated relationship among postnatal 
growth, TACSTD2 methylation, TACSTD2 gene expression, and fat mass. The figure provides a framework with which to infer the direction of 
causality between exposure (postnatal growth), outcome (childhood adiposity), and intermediate phenotypes (DNA methylation and gene 
expression). A SNP in the gene of interest can be used to infer causality. A robust correlation was observed between rs61779196 and DNA 
methylation (bold arrow). This permitted the use of the SNP as a surrogate for methylation levels. Genotype is not influenced by confounding 
factors and cannot be influenced by reverse causation (where the outcome, in this instance adiposity, influences methylation). Lack of asso- 
ciation of the TACSTD2 SNP rs6 1779296 with fat mass provides evidence that DNA methylation is unlikely to have a causal influence on adi- 
posity. Confounding is the most likely explanation for the observed association between fat mass and DNA methylation (and expression) in the 
TACSTD2 gene. 



The same approach could be apphed to numerous other 
loci, and it is our intention to follow up in the same way 
as for other differentially expressed genes identified from 
the exploratory analyses. Although RNA and DNA were 
extracted from peripheral blood and not adipose tissue, 
the gene expression profiles, in terms of pathways per- 
turbed, have been shown to have considerable overlap 
(43). Replication in DNA and RNA extracted from adipose 
tissue would clearly be of interest; however, genotype will 
remain unchanged and the lack of association between 
genotype and fat mass would persist. A major strength of 
the current study is the use of a genetic proxy to probe 
causal relationships, and this genetical epigenomic ap- 
proach may be valuable in future studies of this type. 

The discovery cohort analyzed consisted of individuals 
bom preterm, whereas the replication cohort was largely 
composed of term-bom individuals. It is recognized that 
growth trajectories in these two groups differ markedly 
and that interpretation of confiated results could be mis- 
leading. Slight differences in the definition of postnatal 
growth between the cohorts could not be avoided, as these 
were reliant on available data points (cohort 1, term to 
term plus 12 weeks, and cohort 2, birth to 8 weeks). 
However, the distributions of postnatal growth did not 
differ statistically between the two cohorts and are 
therefore unlikely to have infiuenced the reported associ- 
ations or conclusions drawn regarding methylation and fat 
mass. 

In summary, postnatal growth and childhood fat mass 
were associated with altered TACSTD2 gene expression 
and promoter methylation. However, the lack of associa- 
tion between fat mass and a methylation proxy SNP in this 
gene suggests that confounding might explain the observed 
associations. However, the identification of a correlation 
between DNA methylation and childhood fat mass may 
have important predictive utility regardless of causality. The 
use of genetic proxies in epigenetic studies is advocated to 
strengthen causal inference. 
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